DOCOHEHT RES CHE 



ED 206 509 * SE 035 615 

AUTHOR r . Ganiel, Ori: Hbfstein, Avi v 

TITLE Objective ana Continuous Assessment of student 

Performance in the Physics Laboratory, 
PttB DATE Apr 8.1 . * * 

NOTE 23p.: Paper presented at the Annual Heeting of the 
: National Association for Research in Science Teaching 

(5Hth, Grossinger's in the Catskills, Ellenville, NY, 

April 5-8, 1981) . 

EDRS PRICE HF01/PC01. Plus Postage. 

DESCRIPTOPS Classroom Observation Techniques; Comparative 
- . . Analysis: *Performance Tests: Physics; Science 

Education: *Science Laboratories; Secondary 
Education: Secondary School Science; *Skill 
Development: ^Student Evaluation; Teacher Attitudes; 
♦Test Construction; videotape Recordings 
IDENTIFIERS *Science Education Research • 

ABSTRACT 

Results of a study conducted to dlvelop and validate 
an instrument for oblective and 'continuous assessment of student 
performance in the physics laboratory are* reported. Two modes of \ 
teacher assessment of student performance were compared: (1) 
video-tape recording of two students performing experiments; and (2) 
an instrument for continuous, objective assessment of student 
'pejrf ormancfe. Results include findings that teachers using the- 
instrument considered all student skills and that greater uniformity 
in assessment was achieved. An attitude assessment revealed that 
^eachers demonstrated a high .positive attitude toward use of the * 
instrument.' (A&hor/CSl 



****************************************************** ****^^^ 

* Reproductions supplied by EDRS are the\best that can ba made . * *. 

* from the original document. * 

4c*** ***** **^c* *************************** ****^****^*****<*4 t **** *********** 

9 



1/.S. DEPARTMENT OF EDUCATION 
NATIONAL INSTITUTE OF EDUCATION 
EDUCATIONAL RESOURCES INFORMATION 
CENTER <ERJC) * 
P( This document has been reproduced -as 
received . from, the person or organization 
originating it » 
Minor changgs have been made to improve 
reproductiorvo^ialitv 

• Points of view or opinions stated in this docu 
mem do not necessarily represent officii NlE 
position or policy 



"PERMISSION TO REPRODUCE THIS 
MATERIAL HAS BEEN GRANTED BY 

( in Crafii e\ 



TO THE EDUCATIONAL RESOURCES ' 
INFORMATION CENTER (ERIC)." 



v 



• OBJECTIVE AND CONTINUOUS ASSESSMENT OF 
STUDENT PERFORMANCE IN THE PHYSICS LABORATORY 



Uri Ganiel 
and 

Avi Hof stein 



Department of Science Teaching • 

■9 

The Weizmann Institute of Science 
Rehovot 76100, ISRAEL * 



■ OBJECTIVE AND CONTINUOUS ASSESSMENT Of 
STUDENT PERFORMANCE IN THE PHYSICS LABORATORY 



I. INTRODUCTION * ' , . \ 

The laboratory has'long been used in science instruction to involve 

• students # ih concrete experiences with objects and concepts. 

With the advance -of heto curricula in the 60's.and 70*5. which stress 

* the process of ^science- and emphasize the development of higher cognitive 

- • v 

skills, thef laboratory has acquired a central role. Some science educators 1 

even went as far as regarding the laboratory as^ M ... not just as a place for 

demonstration and confirmation but rather as the core of £tre' science learning 

f ' • ' - w 

process"* (Shulman and Tamir, 1973). Once th^'centra]^ role of the laboratory 

in science instruction is recognized, the problem of assessment of student. 
> 

performance in the labbratory becomes one of the utmost importance . 
Types of assessment of laboratory work 

Several different styles^f assessment of student laboratory skills have 
been utilized. The more common style is based on written evidence, namely * * 
written reports or items on paper and pencil tests- (Kruglak, 1958, Gnobman, 
1970, Doran, 1978, Giddings and Hdfstein, 1980, Lunetta et al , 1981). 
However, since practical work involves abilities^, both manual and Intel lectual , 
which are in some measure distinct from those u§£d in non practical work 
(Kelly and Lister, 1969), the assessment of students during practical sessions 
^should not be overlooked. This argument is also supported by research* 

evidence.. It has been short!) that the correlation between students 1 achievement 
in practical tests and their achievement based on written evidence is rather , 
low". (Robinson, 1969,' Buckley, 1970, Tamir, 1972, Ben Zvi et al . , ]977h 



Thus, there is a need- to develop special measures to assess not only what 
the student .knows about the laboratory but also what he actually does in 
the laboratory. , " • * * 

Manual skills* and resourcefulness in dealing with an experiment need to be 

■\ x __ . ^* ' 

assessed in an actual laboratory situation. Practical examinations were * 

v • . * 

used in several research studies by Tamir (1974) imbiblogy, and by Eglen * 

and Kempa (1974) and Ben-rZvi et al (1$77)' in chemistry. Thfese practical 

examinations utilized systematic observations based on a list o,f specific \ 

criteria, as opposed to an open ended, subjecti ve-iJype assessment, which 

is very common amongst teachers and examiners. 

During the school year, even those teac/iens who employ practical work 

'as part 'of their teaching, tend to stay away from practical 'examinations - 

since their implementation is. time consuming and difficult. Practical 

examinations are also in use by several Boards of Examination 'and in final 

> 

examinations S'n several countrjes (e.g. A-level practical examinations in 
the U.K. and the matriculation in Israel). In these examinations, students 
are usually examined by external examfners 3rui not, by their own teachers. 
Such examinations suffer from several drawbacks: , 1 * 

a. In many cases different examiners use different criteria to assess 
student performance. 

b. They are limited to those experiments tha,t can be readily administered to 
students during a limited time period. This obviously restricts ^boih^the 
scope and validity of the assessments x 1 4 

c. Since such examinations are difficult to implement, they Cannot be ; 
conducted very often, Consequently the element of .chance is ^rather 
dominant. This obviously increases anxiety on behalf of the student. 
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d. Because of administrative constraints, practical examinations 1 will often 
be administered to a large^group of students simultaneously. Consequently , 
•the examiner w>l] not be able to concentrate on observing each student 
systematically, and will have to rely in his assessment on the results of 
—the experiment and orrrthe^ritten reports. *' ~ ~ ~ 

In an attempt to overcome these drawbacks, there has been in recent years 
a mpvement towards implementation of continuous assessment by the teacher in 
normal laboratory sessions. This has been formalized to some degree in the' 
'United Kingdom (University pf London, 1977, J.M.B., 1979). In- this system of 
continuous assessment, the teacher unobtrusively observes each student during 
normal lab activities and rates h<im or her on specific criteria. The assess- 
ments can be recorded for each student after observations over an extended 
period of time. Normally only a few students will be carefully observed and 
rated during each activity. 

Continuous assessment on several occasions throughout the year is necessary 
to adequately' cpver the variety of* tasks and skills which comprise a total v v 
program of practical work. 

Purpose of Study ' •* 

The present study was undertaken -in order tojJevelop and implement an 
Instrument for the assessment of the performance of hig^School students in 
the physics laboratory, in a continuous manner. Once*^uch an instrjum^nt was 
available and was being used, ° we wanted to find out whether its use does 
indeed improve the assessment of students by their teachers. By improvement 
we mean ap increase in precision / in the sense of achieving a better consensus 
amongst teachers when evaluating a N particular student. Secondly, we hoped 
to increase their .objectivity^ , when assessing different students who .may differ 
in their abilities in various components" of the performance in the physics 
laboratory* Clearly, in such a situation, the personal preferences and biases 



of each teacher pTay a major role,' and 'different teachers may tend' to 
emphasize diffejfent components in their" assessment/ By convincing all teachers 
to use the same criteria, one expects to increase the objectivity of their 
assessment. # 

RESEARCH METHOD • '< . ' k 

Population ^ 

150 physics teachers in Israeli .high schools (grades 9-12) were contacted 
and asked to participate fn the experiment. All were- experienced teachers, 
'Who have ^taught physics in high'school for a number of years and- performed 
part of their instruction in the. 4 abira'tory , About half of these teachers 
agreed to participate and out of those 25 were randomly chosen. These 
constituted the group with which this project was undertaken. 

> 

Research Instruments 

~ 5 " * 
a. Video .Films and students 1 reports " r^-^ 

' • * t * 

As a preliminary stage, it was necessary to create a situation in which 

. ' all the participants would be-abl-e to observe and assess the same 

• laboratory exercises, performed by the same students. In considering the 

, ■ various possibilities to achieve this aim, we" tried to satisfy the 

^ fo^lowincj criteria as closely as- possible^ 

('1) Identical condition for* the assessment (i.e. all teachers should 

- watch the performance of the same student performing a lat> exercise) 

(2) Good conditions! for assessment (every teacher should be able to watch 

the student closely). , 4 * * 

• j ■ 

.(3) "Natural" conditions (the student should work in an environment similar 
to that of a regular lab session in school). - 
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We therefore chose to make video films, in each of which a student was 
shown while performing a physics experiment in a* school laboratory all by 
himself. This; does not fully simulate the situation in school, where 
students' normally work in pairs.or even in larger groups, but except fori 
that, this method was* adequate with regard to our requirements-. Three such 
films were made. The experiments were taken from the. regular physics course 

(Physics Experiment for High School, 1975) taught in Israeli high schools 

v * ' 

(junior and senior years:). We chose experiments with which alj teachers 

were familiar, and which contain many elements which need-to'be assessed. 

The students' teachers were consulted ^so that each of the students performed 

an experiment he had not done.'before, but for which he had the necessary 

theoretical preparation- -'Each student receiYed the relevant physics text 

book and the. lab guide which describes the 'experiment ar^d explains what^he 

was expected ta do. All the equipment necessary. was available on the bench, 

and a teacher was on hand to answer any v questions he wanted to ask before 

he started to work. The student was asked to talk while working and explain 

what he was doing, much in the same manner as he would talk to his partners 

in a regular/ x lab session in school. % 

t After performing* the experiment and recording his data, the student was * 

asked to write a lab report. These reports, in the students' handwriting, 

were photographed and later used by the teachers in their assessment.. . 

b. Assessment Scheme , . . 

> • — ■ — 

The central part of the investigation consisted of testing the use of a 
common list of .criteria for assessment by all the teachers/ 

We surveyed the literajune for Similar 'lists , both' in physics and in 
other natural sciences (Mathews, '1969 > Tamir and Glassman, 1971 J.M.B. , 1979) 
A questionnaire was distributed amongst Physics, teachers in^Israel and in it 
they were asked for their opinions about the importance of variou's^fcomponents 

• 7 • • • " ' ' 
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of the work in the physics laboratory. ' • . 

/ • * 

After analyzing the answers received, we interviewed a selected group 
of experienced teachers ami discussed the opinions reflected in these 

answers. Utilizing all the above information a list was made up, which 
consisted of the following five components: 

1. Constructing the experimental set up and manipulative skills. 

2. Observations and measurements. 

• . ■ * 

" 3. Ordered and organize^ work. ^\ 

4. ' Organization and processing of data (including graphs). - 

5. Drawing conclusions and critical discussion. 

Under each of these five headings, there is a more detailed list, which 

* » . * * 

contains the component-skills belonging to this heading. The detailed list 

is given in Appendix *1. 

Components 1, 2, 3 refer to practical skills, i.e. those 'which have to do 

« or 

with manipulation and direct performance during the experiment. Under 4 and 

5 are the cognitive- type ski Hs^ which connect the laboratory performance to 

* - 

the theoretical thinking and drawing of conclusions , and depend on the 
* 

♦ . 

- ability to process information, digest it and utilize it in the learning 
process. 

'/ ' 
The laboratory experience should combine all these elements ; in order 

- * f - 

to be a worthwhile activity in the science teaching process, and it is 

therefore imperative that all of them be ^assessed. 
Procedure '* ' 

/, 

The teachers who participated in the project met twice. The first meeting 

consisted of two par^s. First, the teachers watched two of the films; Before 

watching af film, the participants received the instructions which wefe given 

to the student, whose performance they were about to watch. They were also 

shown the equipment and apparatus used by the student in the experiment. 



ERLC ' 8 



After watching the film, each participant received a copy of the original lab 
report submitted by the student in his handwriting. Each teacher was asked 
to assess the student's overall performance (the performance of the actual 
experiment. y and the lab report)*as he would usually do in .his < everyday teaching. 
In what follows we shall refer tp this grade as the grade in the "subjective" *• 
(S) assessment. • - 

In the second part of the first meeting, the teachers received the check- 
list we' have developed^ The structure of this list and the method of its use 
were explained. In order "to trai'n them in the use of this instrument, the 
teachers were then shown a third film, and given the student's' lab report. 
Tfiey were asked to assess the stifdent's performance., this time using the list*. 
Each of the teachers was asked to start using the instrument in his regular 
laboratory 'lessons. 

During the four months following the first meeting, the teachers used 
the list to cissess their students' performance in' the physics laboratory in 
their schools.. m , 

At the end, of this period, a second meeting took place, and 21 of the 

; -J 
25 teachers who started the project took part. In this meeting the teachers . 

were shown the same two filmswhich they had seen in the first meeting, and 

the same preliminary procedure was followed (i.e. they, were given the instructions 

» 

for the student and could check the equipment he had used). After watching 
each filfl, the^ again received the lab Report, and were asked to assess the 
student's performance. This time, however, they were requested to use the 
instrument, to which they were by now Accustomed. Each teacher was asked to 
assign five grades to the student,' one grade for each of the five components 
of the list. The teachers were asked to state what weights would be adequate 
in their opinion for each component (1-5) in the list. The average (over all 
teachers) for each of the five components- was computed and rounded off. 
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Since there" was, in general, good agreement between the teachers on this 
j point, we considered use of the avera'ge to be the best choice. The grade 
each of the student's obtained in this manner, will be referred to as the 

"objective" (0) grade.. J 

At the end of the project, the teachers- who participated "were asked to 
express their oqjnions about the use of the proposed assessment scheme in 
their physics teachirrg. 

«, • 

III. RESULTS AND DISCUSSION 

• As described in the previous section, the data collected and used in our 

V 

analysis can be summarized as follows: 

Each of the 21 teachers gave a "subjective (S) grade, to student A and to 

r * 

student B. Then, each teacher assigned 5 grades (Op Og, Og) to each 
student, for the five components appearing in the list. The weighted average 

o'f these constituted the "objective 11 (0) grade for*each student. 

x 

In Table I, the means and standard deviations over all 21 teachers are 

r 

presented. • . . 



Insert Tabljs I about here * 



ERLC 



These data will be discussed in reference to the objectives of our study. 
Increase in precision 

/ As already stated, we aim for ,ah improvement in the consensus among 
teachers, when 'assessing the same student. From the data in Table I it is 
seen, that even though the change in. the mean grade (from S to 0) is moderate, * 
there is a substantial decrease in the standard deviations for both student A 
and student B. '"Morgajri" tests (Morgan, 1939) showed this decrease to be 
significant in both cases.** I.f regard the mean as our best approximation 
for the "correct" grade, then more teachers came closer to grading the student 
correctly in the objective mocte of assessment. This improvement in precision 



■ * v 

*" ' . - > ' 

is undoubtedly due to the use of well-defined criteria 1 by apl 1 the teachers, 

. . > ... 

_ rather than grading the student subjectively accordijm to their own preferences. 

Increase in Objectivity m . ' * 

; ' it . : 

A second gspect of student assessment concerns objectivity. 1/ a teacher 

considers in his assessment some aspects of student .performance and disregards 

others, he will obviously grade highly a student who performs well in the 

aspects he considers. Another student, who may even perform well in some 

^aspects this teacher does not consider, will receive a low grade, if his 

performance is weaker on the skills the teacher does consider. Jhrs situation 

, (or similar ones with different combinations of student skill vfc. teacher 

preferred aspects of performance) clearly occurs in classroom situations, in 

♦ 

which teachers have to evaluate different. students on different occasions. 

By using a well defined list of criteria, every teacher is forced to 
consider all .aspects of student performance which are included in the -list. 
Clearly, some teachers are more lenient thanothers, but this leniency (or 
- lack of it) should manifest. itself .in all the aspects which they assess. 
Hence, objectivity should ^manifest itself in tbat .the "grades teachers give 
to different students will be highly co rr elite d. • We computed the correlation 
(Pearson r) between the setfc of scores obtained by student A and by student B, 
both for the subjective (S) and the objective (0)' modes of assessment. 
For the S scores we obtained r = 0.36 $ = 0.11) while for the 0 scores 
we find r = 0.70 (p = 0.0004). 
* These results are also clear from Figs. 1 and 2, where w'e plot each score < 
received by student A vs. the score received by student B from the same teacher. 



• Insert Figures 1 and /2 about hefe 

*, : 
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The results confirm the hypothesis that by using assessment schedules 
which were'based on ready made criteria, teachers were forced to assess 
students much more objectively: they assigned grades to each of the abilities 
nwh+eh-shoulcf be considered, and consequently became much more consistent in 
their overall assessment of stud^t achievement in the physics laboratory. 

Teacher's Attitudes 

When trying to introduce a new assessment scheme into schools, teachers 1 
attitqdes and their willingness to adopt it should' not be overlooked. In 
order to evaluate this aspect a 16 item Lykert type questionnaire was • * 
administered to the 25 teachers who started this project. In this questionnaire 
the teachers were asked to express their opinions , about the usability of the 
list, whether they thought it helped them to arrive at a better -assessment" of 
their students, and whether they would continue using it in the future. 
It was found, that teachers agreed that the use of the list greatly improved 
the quality of their assessment. However, the 4 teachers who'did not take 
part .in the second part of the experiment (the objective assessment) did not 
.use it in their classrooms. They agreed that the list should be used in the 
final practical- examinations , to ensure precision and objectivity, but they 
found jt difficult to use it continuously in their regular teaching frt school. 

We also interviewed all the teachers at the end of the project, and they 
expressed their opinions about the assessment scheme, ~Sop*e Qf £he remarks 
made by teachers during these interviews are quoted below: ^ 

- n ,,, presently, the criteria (for assessment) are totally vague, and such 
a list is an absolute must.,," 

- one single experiment and the impression of it (in the matricujation 
examination) cannot come instead of continuous observations by the teacher 
during the year" . 



- "... if in the first meeting [i.e. where the assessment was done subjectively, 

1 

without the* list) there was such a wide distribution, of grades, based on 

all kinds of arguments, then the use of such a list should be made compulsory... 

- "... I had some difficulties using the list, so .1 posted it at the entrance 
to the lab. My students could read it and check which of the stalls (in the 
list) is neces-sary in the experiment they were performing. I thi nk this was 
even more important than the grades J gave them at the end." 

IV. CONCLUSIONS J ; • m . ' 

t 

For many years, science educators have ifecoghized.the importance of the 
laboratory in science teaching and learning! If one agrees that the laboratory 
is a unique mode of instruction r then the assessment of this mode should not be 
v o!veMookec[ both by teachers and by those involved- in teacher training. Assessment 
of laboratory skills, which is based exclusively on written evidence',* will % 
/necessarily neg]ect those skills which manifest themselves during the actual 
performance of the laboratory exercise. Even practical examinations have 
severe drawbacks, if they are used only~off special occasions, such as in final 
examinations. Furthermore, practical examinations in which the assessment is 
not based on clearly defined criteria arenot very useful, since their outcome 
will be greatly influenced by personal preferences andTnases of the particular 
examiner. 

We have tried out an assessment scheme .which is based on a clearly defined 
list of criteria. It was found that by using this procedure, the assessment of 
^ laboratory performance was greatly improved, bpth in precision £nd in objectivity. 
Furthermore, this/scheme is adequate for continuous assessment by* the teacher 



fns/s 
Dpn di 



\ 

in his classroortf during the school, y$ar,%and therefore offers a remedy to most 



of the disadvantages discussed abcy/e. 
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Finally, we feel that those Involved in teacher trailing should realize 
that there is a need^to incorporate such methods of evaluation of student 
performance in the laboratory into their training programs, so that in the 
future it will be used by the teachers as part of their physics instruction. 



\ 
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Appendix 1 . , , 
ASSESSMENT LIST 

1. Constructing the experimental set-up and manipulative skills (15%)* 

- identifies the components of an experimental set-up from a 
schematic diagram. • 

- constructs the experimental set-up' according to a scheme 
^ or to instructions. 

- uses the correct equipment for^each measurement or observation. 

- overcomes simple malfunction of equipment. - 

2.. Observations and measurements (25%)* 

- reads the instruments correctly (scale, .parallax, etc.) 

- performs observations and nieasurements correctly. 

.- records the results (of observations and measurements) clearly. 

3. Ordered and Organized Work (10%)* 
*- arranges the equipment on the bench iri an orderly manner. 

- keeps tidy and orderly rac^ds in the notebook.- , ^ 

> - observes safety regulations. ^. 

f completes the prescribed task in the time allotted. 

- concentrates, on his work and does not disturb his fellow students. 

J ' . 4> Organization" and processing of data^( including graphs) ' (25%)* 

- processes the results of observations correctly. 

- organizes tables of measured data efficiently and clearly. 

- records results of measurements with correct units and * 
significant digits . \ , 

- constructs graphs correctly (scales ,* mark data'points in 
coordinate system, best line through data points, ...) 

. % - obtains numerical values- from graphs. 

- chooses correct formulae Tor' calculating unknown variables. 

' IS *. • • 
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- substitutes data correctly in formulae. 

- performs computations correctly* 

Drawing conclusions and Critical discussion ^ (25%)* 

- understands the objectives of an experiment, and the connection 
between them and the experimental procedure of its execution. 

e 4 • 

- decides .how. many measurements are necessary in ordSr to find 

» * 

relations between variables. 

* « 

- identifies the s^pe of a graph and use it to find a functional 



dependence between variables. • - • 

discovers and identifies reasons for illogical results, 
identifies reasons for improper performance of an experimental system, 
understands the limitations of an experiment, and tries' to find ways 
for improvement. . f 

formulates new assumptions in view of experimental results. 



f percentage of the component in the total gra^e. 
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Means and Standard Deviations 
• of teachers 1 grades (N = 21) 





Student A 


Student B 




Average , 


' Std. Dev." 


" Average 


Std. Dev. 


s 


7.60 


1.23 * 


7.05 


Ir04 ** 


°1 


8.79 


0.90 


7.60 


1.24 


■ °2 


8.40 


0.89 


4 

•6.95 


0.92 


d 3 


8.62. 


0.91 


7.31 


1.05 


°4 


6.90 


1.30 


6.38 


1.07 


' °5 


7.33 


1.29 


6.14 


1.01 


0 


' .7.84* 


0.81 * 


* • 6.74 


0.64 ** 



* Morgan test- : t 1 = 2.15 ' p = 0.02 
** Morgan test : t = 3.04 • p = 0.003 
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Figure Captions • — * • 

' ' * i 

/' I 

Figure 1 grades received by student A versus grades received by 

„, ► 

student B - subjective (S) assessment. 

Figure 2 grades received by student A versus grades received by * 
. student B - objective (0) assessment. 
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